An R package for analyzing and modeling ranking data
نویسندگان
چکیده
BACKGROUND In medical informatics, psychology, market research and many other fields, researchers often need to analyze and model ranking data. However, there is no statistical software that provides tools for the comprehensive analysis of ranking data. Here, we present pmr, an R package for analyzing and modeling ranking data with a bundle of tools. The pmr package enables descriptive statistics (mean rank, pairwise frequencies, and marginal matrix), Analytic Hierarchy Process models (with Saaty's and Koczkodaj's inconsistencies), probability models (Luce model, distance-based model, and rank-ordered logit model), and the visualization of ranking data with multidimensional preference analysis. RESULTS Examples of the use of package pmr are given using a real ranking dataset from medical informatics, in which 566 Hong Kong physicians ranked the top five incentives (1: competitive pressures; 2: increased savings; 3: government regulation; 4: improved efficiency; 5: improved quality care; 6: patient demand; 7: financial incentives) to the computerization of clinical practice. The mean rank showed that item 4 is the most preferred item and item 3 is the least preferred item, and significance difference was found between physicians' preferences with respect to their monthly income. A multidimensional preference analysis identified two dimensions that explain 42% of the total variance. The first can be interpreted as the overall preference of the seven items (labeled as "internal/external"), and the second dimension can be interpreted as their overall variance of (labeled as "push/pull factors"). Various statistical models were fitted, and the best were found to be weighted distance-based models with Spearman's footrule distance. CONCLUSIONS In this paper, we presented the R package pmr, the first package for analyzing and modeling ranking data. The package provides insight to users through descriptive statistics of ranking data. Users can also visualize ranking data by applying a thought multidimensional preference analysis. Various probability models for ranking data are also included, allowing users to choose that which is most suitable to their specific situations.
منابع مشابه
A modified branch and bound algorithm for a vague flow-shop scheduling problem
Uncertainty plays a significant role in modeling and optimization of real world systems. Among uncertain approaches, fuzziness describes impreciseness while for ambiguity another definition is required. Vagueness is a probabilistic model of uncertainty being helpful to include ambiguity into modeling different processes especially in industrial systems. In this paper, a vague set based on dista...
متن کاملRankcluster: An R Package for clustering multivariate partial ranking
Rankcluster is the first R package dedicated to ranking data. This package proposes modelling and clustering tools for ranking data, potentially multivariate and partial. Ranking data are modelled by the Insertion Sorting Rank (isr) model, which is a meaningful model parametrized by a central ranking and a dispersion parameter. A conditional independence assumption allows to take into account m...
متن کاملseqMeta: an R Package for meta-analyzing region-based tests of rare DNA variants
Region-based tests are becoming a popular tool for analyzing rare genetic variants. In order for these tests to have adequate power, it is often necessary to meta-analyze information from multiple contributing studies, where consent restrictions make it difficult or impossible to share individual level data. We present the R package seqMeta for meta-analyzing region based tests, such as SKAT, S...
متن کاملseqMeta: an R Package for meta-analyzing region-based tests of rare DNA variants
Region-based tests are becoming a popular tool for analyzing rare genetic variants. In order for these tests to have adequate power, it is often necessary to meta-analyze information from multiple contributing studies, where consent restrictions make it difficult or impossible to share individual level data. We present the R package seqMeta for meta-analyzing region based tests, such as SKAT, S...
متن کاملseqMeta: an R Package for meta analyzing region-based tests of rare DNA variants
Region-based tests are becoming a popular tool for analyzing rare genetic variants. In order for these tests to have adequate power, it is often necessary to meta-analyze information from multiple contributing studies, where consent restrictions make it difficult or impossible to share individual level data. We present the R package seqMeta for meta-analyzing region based tests, such as SKAT, S...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 13 شماره
صفحات -
تاریخ انتشار 2013